Detecting Anomalies in Financial Data Using Machine Learning Algorithms
نویسندگان
چکیده
Bookkeeping data free of fraud and errors are a cornerstone legitimate business operations. The highly complex laborious work financial auditors calls for finding new solutions algorithms to ensure the correctness statements. Both supervised unsupervised machine learning (ML) techniques nowadays being successfully applied detect anomalies in data. In accounting, it is long-established problem misstatements deemed anomalous general ledger (GL) Currently, widely used such as random sampling manual assessment bookkeeping rules become challenging unreliable due increasing volumes unknown fraudulent patterns. To address risk audit inefficiency, we seven ML inclusive deep two isolation forest autoencoders. We trained evaluated our models on real-life GL dataset vectorization resolve journal entry size variability. evaluation results showed that best have high potential detecting predefined anomaly types well efficiently discern higher-risk entries. Based findings, discussed possible practical implications resulting accounting auditing contexts.
منابع مشابه
Machine learning algorithms for time series in financial markets
This research is related to the usefulness of different machine learning methods in forecasting time series on financial markets. The main issue in this field is that economic managers and scientific society are still longing for more accurate forecasting algorithms. Fulfilling this request leads to an increase in forecasting quality and, therefore, more profitability and efficiency. In this pa...
متن کاملA hybrid model based on machine learning and genetic algorithm for detecting fraud in financial statements
Financial statement fraud has increasingly become a serious problem for business, government, and investors. In fact, this threatens the reliability of capital markets, corporate heads, and even the audit profession. Auditors in particular face their apparent inability to detect large-scale fraud, and there are various ways to identify this problem. In order to identify this problem, the majori...
متن کاملMachine learning algorithms in air quality modeling
Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...
متن کاملSpatiotemporal Estimation of PM2.5 Concentration Using Remotely Sensed Data, Machine Learning, and Optimization Algorithms
PM 2.5 (particles <2.5 μm in aerodynamic diameter) can be measured by ground station data in urban areas, but the number of these stations and their geographical coverage is limited. Therefore, these data are not adequate for calculating concentrations of Pm2.5 over a large urban area. This study aims to use Aerosol Optical Depth (AOD) satellite images and meteorological data from 2014 to 2017 ...
متن کاملDetecting anomalies in data streams using statecharts
The environment around us is progressively equipped with various sensors, producing data continuously. The applications using these data face many challenges, such as data stream integration over an attribute (such as time) and knowledge extraction from raw data. In this paper we propose one approach to face those two challenges. First, data streams integration is performed using statecharts wh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Systems
سال: 2022
ISSN: ['2079-8954']
DOI: https://doi.org/10.3390/systems10050130